Text to Point Cloud Localization with Relation-Enhanced Transformer

نویسندگان

چکیده

Automatically localizing a position based on few natural language instructions is essential for future robots to communicate and collaborate with humans. To approach this goal, we focus text-to-point-cloud cross-modal localization problem. Given textual query, it aims identify the described location from city-scale point clouds. The task involves two challenges. 1) In clouds, similar ambient instances may exist in several locations. Searching each huge cloud only as guidance lead less discriminative signals incorrect results. 2) descriptions, hints are provided separately. case, relations among those not explicitly described, leaving difficulties of learning agent itself. alleviate challenges, propose unified Relation-Enhanced Transformer (RET) improve representation discriminability both nature queries. core proposed RET novel Relation-enhanced Self-Attention (RSA) mechanism, which encodes instance (hint)-wise modalities. Moreover, fine-grained matching method further refine predictions subsequent instance-hint stage. Experimental results KITTI360Pose dataset demonstrate that our surpasses previous state-of-the-art by large margins.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Natural scene text localization using edge color signature

Localizing text regions in images taken from natural scenes is one of the challenging problems dueto variations in font, size, color and orientation of text. In this paper, we introduce a new concept socalled Edge Color Signature for localizing text regions in an image. This method is able to localizeboth Farsi and English texts. In the proposed method rst a pyramid using diff...

متن کامل

An Enhanced Viscoplastic Constitutive Model for Semi-Solid Materials to Analyze Shear Localization

Semi-solid materials undergo strain localization and shear band formation as a result of granular nature of semi-solid deformation. In the present study, to analyze the shear localization, a unified viscoplastic constitutive model was developed for the homogeneous flow. Then, a linearized analysis of the stability performed by examining the necessary condition for the perturbation growth. For t...

متن کامل

Computing with Point Cloud Data

Point clouds are one of the most primitive and fundamental manifold representations. A popular source of point clouds are three-dimensional shape acquisition devices such as laser range scanners. Another important field where point clouds are found is the representation of high-dimensional manifolds by samples. With the increasing popularity and very broad applications of this source of data, i...

متن کامل

Point-to-point and Point-to-multipoint CDMA Access Network with Enhanced Security

We propose a network implementation with enhanced security at the physical layer by means of time-hopping CDMA, supporting cryptographically secure point-to-point and point-to-multipoint communication. In particular, we analyze an active star topology optical network implementation capable of supporting 128 simultaneous users up to 20 km apart. The feasibility of the proposed scheme is demonstr...

متن کامل

Simultaneous Determination of Carbazoles in Water Samples by Cloud Point Extraction Coupled to HPLC

Cloud point extraction (CPE) as a rapid, simple and efficient method coupled with high performance liquid chromatography (HPLC) was used for sample preparation and subsequent determination of carbazole, trinitrocarbazole (TrNC) and tetra nitro carbazole (TNC) in water samples. Some effective parameters on extraction, such as volume of Triton X-100, extraction time, extraction temperature, ionic...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i2.25347